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Abstract 

We revisit the traveling salesman problem with neighborhoods (TSPN) and obtain several 
approximation algorithms. These constitute cither improvements over previously best approx- 
imations achievable in comparable times (for unit disks in the plane), or first approximations 
ever (for hyperplanes and lines in R rf , and unit balls in M 3 ). 

(I) Given a set of n hyperplanes in R , a TSP tour that is at most O(l) times longer than 
the optimal can be computed in 0(n) time, when d is constant. 

(II) Given a set of n lines in R d , a TSP tour that is at most 0(log 3 n) times longer than the 
optimal can be computed in polynomial time, when d is constant. 

(III) Given a set of n unit disks in the plane or n unit balls in R 3 , we improve the approxi- 
mation ratios relying on a black box that computes a good approximate tour for a set of points 
in the ambient space (in our case, these are the centers of a subset of the disks or the balls). 

Keywords: Traveling salesman, group Steiner tree, linear programming, minimum-perimeter 
rectangular box, approximation algorithm, lines, planes, hyperplanes, unit disks and balls. 

1 Introduction 

In the Euclidean Traveling Salesman Problem (ETSP), given a set of points in the plane (or in 
higher-dimensional Euclidean space), one seeks a shortest tour (closed curve) that visits each point. 
In the TSP with neighborhoods (TSPN) , first studied by Arkin and Hassin [2] , each point is replaced 
by a (possibly disconnected) region. The tour must visit at least one point in each of the given 
regions (i.e., it must intersect each region). A tour for a set of neighborhoods is also referred to 
as a TSP tour. Since the Euclidean TSP is known to be NP-hard in every dimension |22[ [2"5[ I41j . 
TSPN is also NP-hard. TSP is recognized as one of the corner-stone problems in combinatorial 
optimization. See |36[ I37j for a list of related problems in geometric network optimization. 



Related work. It is known that the Euclidean TSP admits a polynomial-time approximation 
scheme in M. d , where d = 0(1), due to Arora [3 J and Mitchell |35j . Subsequent running time 
improvements have been obtained by Rao and Smith [33]. In contrast, the TSP with neighborhoods 
has been harder to approximate, at least so far. Certain instances are known to be APX-hard. 
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Arkin and Hassin [2] gave constant-factor approximations for translates of a convex region, 
translates of a connected region, and more generally, for regions with diameters parallel to a common 
direction, and with the ratio between the longest and the shortest diameter bounded by a constant. 

For n connected (possibly overlapping) neighborhoods in the plane, TSPN can be approximated 
with ratio O(logn) by the algorithm of Mata and Mitchell [31]. See also the survey by Bern and 
Eppstein [4] for a short outline of this algorithm. Subsequent running time improvements have 
been offered by Elbassioni et al. [T7] and by Gudmundsson and Levcopoulos [25]. At its core, 
the 0(log n)-approximation relies on the following early result by Levcopoulos and Lingas [30] : 
Every (simple) rectilinear polygon P with n vertices, r of which are reflex, can be partitioned into 
rectangles of total perimeter per(P) logr in 0(n log n) time. 

The first PTAS for TSPN (beyond the PTAS for points) was obtained for fat regions of about 
the same size and of bounded depth in the plane by Dumitrescu and Mitchell [15]. In particular, 
this includes the case of pairwise-disjoint unit disks. For the related minimum corridor connection 
problem, Bodlaender et al. [6] present a PTAS for the case of fat rooms of comparable size (based 
on Arora's PTAS for Euclidean TSP), and give a constant-factor approximation algorithm for the 
case of fat rooms of varying sizes. 

Using an approximation algorithm due to Slavik [46] for Euclidean group TSP (see below), 
de Berg et al. [IT] obtain constant-factor approximations for disjoint fat convex regions in the 
plane, not necessarily of comparable size. Elbassioni et al. [16J improve the runtime of the ap- 
proximation algorithm. Subsequently, Elbassioni et al. |17t [18] give constant-factor approximations 
for (possibly intersecting) fat convex regions of comparable size. Mitchell [38] gave a PTAS for 
bounded depth fat regions (of arbitrary sizes) in the plane; in particular for disjoint fat regions 
in the plane. Recently, Mitchell [39J obtained constant-factor approximations for pairwise-disjoint 
connected neighborhoods of any size or shape (not necessarily convex). In another recent develop- 
ment, Chan and Elbassioni [8] gave a quasipolynomial-time approximation scheme for fat, weakly 
disjoint regions in M. d , for d = 0(1) (which also works in metric spaces with bounded doubling 
dimension) . 

Finally, interesting variants are those with unbounded neighborhoods, such as lines or planes. 
TSPN for n lines in the plane can be solved optimally in 0(n 5 ) time [7] |4"T] |4"8] (see also [29]), 
and a 1.28- approximation can be computed in 0{n) time [13J. In contrast, TSPN for lines in R 3 is 
NP-hard. The status of TSPN for planes in M 3 appears to be unknown. 

Regarding the degree of approximation achievable, TSPN with arbitrary neighborhoods is gen- 
erally APX-hard [11} |4"5] , and it remains so even for segments of nearly the same length |17] . For 
instance, approximating TSPN for connected regions in the plane within a factor smaller than 2 is 
intractable (NP-hard) [45] , The problem is also APX-hard for disconnected regions [45] . the sim- 
plest case being point-pair regions |12] , It is conjectured that approximating TSPN for disconnected 
regions in the plane within a 0(log 1//2 n) factor is intractable [45] . Similarly, it is conjectured that 
approximating TSPN for connected regions in M 3 within a C^log 1 / 2 n) factor and for disconnected 
regions in M 3 within a 0(log 2 / 3 n) factor [45] are intractable. Moreover, proving these conjectures 
seems to require advances in complexity, rather than geometry. 

Our results. In this paper we present several improved approximation algorithms for TSPN, for 
three types of neighborhoods: (i) hyperplanes in R d ; (ii) lines in M. d ; (iii) congruent disks in the 
plane and congruent balls in M 3 . Our results are summarized in Table [1] 

We start with hyperplanes in M. d . No approximation algorithm was known for this type of 
neighborhoods. For constant d, we can compute constant-factor approximations in linear time. 
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Theorem 1. Given a set of n hyperplanes in R , and e > 0, a TSP tour that is at most 
2 d ~ 3 ^i(l + e) times longer than the optimal can be computed in 0(de 1 ~ d n) time. In particu- 
lar for d = 3, a TSP tour that is at most 2.31 times longer than the optimal can be computed in 
0(n) time. 

We continue with lines in R d , a problem much harder to deal with. Note that an instance with 
parallel lines reduces to an instance of the Euclidean TSP for points in one dimension lower (namely 
the points of intersection between the given lines orthogonal to a hperplane). Here we obtain the 
first approximations. 

Theorem 2. Given a set of n lines in R d , where d is a constant positive integer, a TSP tour that 
is at most 0(log 3 n) times longer than the optimal can be computed in polynomial time. 

For n unit disks in the plane (resp., unit balls in R 3 ), we give constant-factor approximations 
by using a black box that computes a good tour of at most n points (the centers of a suitable 
subset of disks, resp., balls). While for disjoint unit disks in R 2 the existence of a PTAS has been 
established a decade ago |15j . it has remained open for disjoint unit balls in R 3 . The constant-factor 
approximations we give here reduce one instance of TSPN with disks (or balls) to one with points, 
and thus lead to faster and also conceptually simpler algorithms. For unit disks we obtain an 
improved approximation factor 6.75; the previous best ratio is 7.62, that also holds for translates of 
a convex region [15]. Let T(n,d,e) denote the running time for computing a (1 + e)-approximation 
of an optimal tour of n points in R rf . 

Theorem 3. Given a set of n unit disks in the plane, and e > 0, a TSP tour that is at most 
^| + ^r^J + times longer than the optimal, apart from an additive constant, can be computed 
in time 0{T(n, 2, 1.8 e)). In particular, a TSP tour that is at most 6.75 times longer than the 
optimal can be computed in time 0(T(n, 2, 0.0018)). Alternatively, a TSP tour that is at most 8.52 
times longer than the optimal can be computed in time 0(n 3 / 2 log 5 n). 

For congruent balls in R 3 we give the first explicit constant approximation factor, not in the 
O(l) form. 

Theorem 4. Given a set of n unit balls in R 3 , and e > 0, a TSP tour that is at most 54\/3(l + e) 
times longer than the optimal, apart from an additive constant, can be computed in time 0(T(n, 3, e)). 
In particular, a TSP tour that is at most 100.61 times longer than the optimal can be computed in 
time 0(T(n, 3, 0.01)). Alternatively, a TSP tour that is at most 104.1 times longer than the optimal 
can be computed in time 0(n 3 ). 

Preliminaries. Let TZ be a set of regions in R rf , d > 2. A set E C R rf intersects TZ if S intersects 
each region in 1Z, that is, S n r ^ 0, Vr G TZ. A shortest TSP tour for a set 7Z of regions 
(neighborhoods) is a shortest closed curve in the ambient space that intersects TZ. 

We use the following terms and notations. The shortest TSP tours for a set of disks (or balls) 
V, set of lines C, and set of hyperplanes H, are denoted by OPT(D), OPT(£) and OPT(H), 
respectively. The Euclidean length of a curve 7 is denoted by len(7), or just I7I when there is no 
danger of confusion. Similarly, the total (Euclidean) length of the edges of a geometric graph G or 
a polygon P is denoted by len(G) and per(P), respectively. For a hyperrectangle (rectangular box) 
Q in with sides w\, . . . , Wd, the total edge length per(Q) = 2 d_1 Ylt=i w i ^ s called its perimeter. 
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Region type 


Old ratio 


New ratio 


NP-hard 


Hyperplanes in M. d 


— 


(l + s)(dVa + d- 1 l 2 )2 d -' i 


? 


Planes in R 6 


— 


2.31 in 0(n) time 


? 


Lines in M d 




(3(iog 3 n) 


yes 


Disjoint unit disks in the plane 


3.55 




yes 


Unit disks in the plane 


7.62 


6.75 


yes 


Disjoint unit balls in M 3 




7.01 


yes 


Unit balls in R 3 




100.61 


yes 



Table 1: Old and new (asymptotic) approximation ratios obtained in polynomial time. The ratios in rows 
3-6 (for disks and balls) are obtained by using the black box PTAS for computing point tours. Disjoint unit 
disks admit a PTAS [15] but its time complexity is higher than that of the PTAS for points. The old ratios 
reported in column 2 are from [15] . 

For a > 1, we say that an approximation algorithm (for TSPN) has ratio a if its output 
tour ALG satisfies len(ALG) < alen(OPT), and has asymptotic ratio a if its output satisfies 
len(ALG) < a len(OPT) + /3 for some constant (3 > 0. 

The convex hull of a set A C M. d is denoted by conv(A). The Cartesian coordinates of a point 
p G M. d are denoted by x\(p), . . . , x d (p). For a line segment s G M 3 , Ai(s), . . . , Arf(s) denote (the 
lengths of) its projections on the d coordinate axes. 

2 TSPN for hyperplanes in R d 

In this section, we present a constant factor approximation algorithm in M d with ratio (l+e)2 d ~ 3 ^j 
in 0(n) time, for constant d and e > 0. For dimensions d = 3,4,5, we achieve a better approxima- 
tion ratio. In particular, for e = 0.0002, we get the approximation ratio 2.31 in IR 3 , 5.001 in M 4 , 
and 10.74 in M 5 . 

Our algorithm is based on solving low-dimensional linear programs; it combines ideas from |13|. 
[T4l [To] [29] . We show below (Lemma [2]) that any closed curve 7 C M rf is contained in a rectangular 

box of edge lengths u>i, . . . , Wd such that Ya=i w i — 2 l en (7)- We apply this result for the optimal 
tour OPT('P). Then we use linear programming to compute a (1 + e)-approximation for the 
minimum-perimeter rectangular box intersecting %. 

Let Q be rectangular box in M, d such that the d extents of Q are w\ < W2 < • • • < w^. It is not 
difficult to see (by induction on d) that Q admits a Hamiltonian cycle of total length 

d 

T (Q) = 2 d " V + 2 d ~ 2 w 2 + ■■■ + 2w d ^ x + 2w d = w d + Y^ 2 d ~ j Wj. 

The orientation of a rectangular box Q in M. d is given by an orthonormal basis whose vectors 
are parallel to the edges of Q. Select a set A = {ai, . . . ,a m } of m = 0(e l ~ d ) orientations that 
cover all possible orientations within an error of e/{d — 1). That is, for any orientation a, there is 
one in the discrete set A so that the angle difference between the corresponding vectors is at most 
e/(d — 1). We may assume, without loss of generality, that no vector in any orthonormal basis in 
A is parallel to the coordinate axes x%, . . . x d - 
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Algorithm Al. 

Step 1: Let m = 0(e 1 ^ d ). For each i = 1, . . . , m, compute a minimum-perimeter rectangular 
box Qi with orientation ajj that intersects 

Step 2: Let Q be a box with the minimum perimeter over all m directions, found above. 
Return a Hamiltonian cycle of the 2 d vertices of Q, of length r(Q), as depicted in Fig. Q] (right). 
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Figure 1: Left: An axis-aligned rectangular box Q. Right: a Hamiltonian cycle (in bold lines) of length 
21 + 2w + Ah of the vertices of Q that visits all planes intersecting Q. 

The key observation is the following. 

Observation 1. If 7 is an intersecting polygon ofH and Q is a rectangular box containing 7, then 
any Hamiltonian cycle of the vertices of Q is also an intersecting polygon ofV. 

We compute the box Qi by linear programming. By a suitable rotation of the set H of hy- 
perplanes in each iteration i > 1, the box Qi is axis-aligned. This can be obtained in 0(n) time 
per iteration. For a plane a, let u(a) denote the unit vector orthogonal to a with a positive 
coordinate. An axis-aligned rectangular box in W d has 2 d ~ 1 antipodal pairs of vertices, which we 
denote by and U, for i = 1, ...,2 d ~ 1 , such that the vector Sjtj has a positive x^-coordinate. 
Partition % into 2 d ~ 1 types based on the following rule (ties are broken arbitrarily): 

• er € T~L is of type i, i G {1, . . . , 2 d_1 }, if the -u(cr)-minimal and u(<r)-maximal vertices of Qi are 
Si and tj, respectively. 

Let T 3 = Ui=i Pi be the corresponding partition of the hyperplanes given by this rule. For 
a hyperplane a, that is not parallel to any coordinate axis, denote by a(p) < (respectively, by 
o~(p) > 0) that a point p G R d lies in the closed halfspace bounded from above by a (resp., bounded 
from below by a). Observe that for i = 1, . . . 2 d ~ 1 , 

• a plane a £ Vi intersects the rectangular box Qi if and only if the o~(si) < < o~(ti). 

The minimum-perimeter objective is naturally expressed as a linear function. The resulting 
linear program has 2d variables Xi,yi,... ,Xd,yd for the box Qi = [x%, y\\ x . . . x [xd,yd], and 2n + d 
constraints. 



minimize — x j) (LPl) 



subject to 



o-(si) < 

<r(ti) > 
Xj < Vj 



if a G ^, VffGW 
if a G ^, Vcr G % 
Vj G {1, 
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Let Q* be a minimum-perimeter rectangular box intersecting %, with side lengths denoted by 
w\, . . . , Wd- To account for the error made by discretization, we need the following easy fact. The 
planar variant was shown in [14\ Lemma 2]. We include the almost identical proof for completeness. 

Lemma 1. There exists i £ {1, . . . , m} such that per(Qi) < (1 + s) per(Q*). 

Proof. Consider a box Qi, i £ {l,...,m}, that minimizes the angle difference /3 between the 
orientations of Qi and Q* . By construction, there exists i € {l,...,m} such that the angle /3 
between the orientations of Qi and Q* is at most e/(d — 1), that is, f3 < e/(d — 1). 

Let Q\ be the minimum-perimeter box with the same orientation as Qi such that Q\ contains Q* . 
By definition, per(Qj) < per(Q'j). An easy trigonometric calculation shows that the corresponding 
sides w[, . . . ,w' d of Q\ are bounded from above as follows. For j = 1 . . . , d, we have 

w'j < wj cos/3 + (y^Wfc) sin/3 < uij + (^w k ) - £ - . 

k^=j k^j 

Consequently, 

d d 

Y,w'j < (l + e)^ / w j , 

3=1 3=1 

that is, 

pev(Q'i) < (l + e)per(Q*). 
Since per(Qi) < per(Q^), it follows that per(Qi) < (1 + e) per(Q*), as required. □ 

Lemma 2. A closed curve 7 C M. d can be included in a rectangular box with side lengths w\, . . . ,Wd 
satisfying Y?j=i w j < ^T len (7)- 

Proof. Let 7 be a closed curve and let Q = Q{j) be a minimum-perimeter enclosing rectangular 
box. Assume for convenience that Q is axis-aligned, so that its extents in the d coordinates are 
wi, . . . ,Wd, respectively. Since Q has minimum perimeter, 7 meets each (d — l)-dimensional face 
of Q. Arbitrarily select a point Oj of 7 on each of the 2d faces of Q, in the order traversed by 
7, to obtain a polygonal closed curve 71 = (a±, . . . ,a,2d) still enclosed in Q (duplicate points are 
possible). We label the 2d faces accordingly by /1, . . . , fid such that dj G /j, for each i = 1, . . . , 2d. 

By the triangle inequality, len(7i) < len(7). Successively reflect the box Q with respect to the 
faces f'2, ■ ■ ■ , f2d- Let a[ be the final reflection of point a\ E 7. The segments composing 71 can 
be retrieved in the reflected boxes; they make up an open polygonal path 72 connecting a\ and a! x . 
By construction we have len(7i) = len(72). 

It is easily seen that /S.j{a\a l 1 ) = 2uij, for each dimension j = 1, . . . , d. By Pythagoras' Theorem, 

d d 
\aia[\ 2 = Aj(aia[) = 4 u>|. 
3=1 3=1 

Since the shortest distance between two points is a straight line, len(72) > |aia^ |. It follows that 

d 

len 2 ( 7 ) > len 2 ( 7 i) = len 2 (7 2 ) > \aia[\ 2 = 4^w 2 . (1) 

3=1 
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By the Cauchy-Schwarz inequality, dJ2j=i w ] > (J2j=i w j) 2 ' anci substituting this into (p} yields 

w (7) > 4 £ = $ (d£, «j) > \ fx: J . ( 2) 

3=1 \ 3=1 J \3=1 J 

Taking square roots yields Ylj=i w j — 2 ^ en (7)' as required. □ 

Let L* = len(OPT). Algorithm Al computes a tour T of length L = r(Qi) = w d + ^=i 2 d ~ j Wj, 

where Y?j=i w j < i 1 + e)^L*, by Lemmas CD and EJ By Observation [Q T is a valid tour of the 
hyperplanes in %. Note that 2 d ~ 1 of the 2 rf edges in the tour of length t(Q) have length w\. Since 
w 1 < W2 ■ ■ ■ < Wd, we have 



d 

l . 

Wl 



<-,Yw i < (l + e)^-L* = h+e)J—L*. 
3=1 



Consequently, 

d 

L = T(Qi) = w d + ^2 d -i Wj = 2 d - 1 w l + 2 d - 2 w 2 + . . . + 2w d _ 1 + 2w d 
3=1 

d ( d 

< 2 d - 2 Wl + 2 d ~ 2 wj = 2 d ~ 2 Iwi + ^2 wj 
3=1 \ 3=1 

In particular, for d = 3 and e < 0.00022, we have L < 2.31L*, thus algorithm Al computes a 
tour that is at most 2.31 longer than the optimal. The algorithm solves a (large!) constant number 
of 6-dimensional linear programs, each in 0(n) time [34] . The overall time is 0(n). A modest 
number of linear programs suffices to get a weaker approximation, say 2.5 or 3. 



Remark. A standard reduction |42j from the sorting problem, or from the convex hull problem, 
applied to a suitable set of vertical planes shows that a shortest TSP tour for n planes cannot 
be computed in 0(n) time; that is, in the worst-case, finding it requires Q(nlogn) time in the 
algebraic decision tree model of computation. 



3 TSPN for lines in R d 

In this section we prove Theorem Let £ = {£1, . . . ,£ n } be a set of n lines in M. d , d > 3. We 
reduce the TSPN problem to a group Steiner tree problem on a geometric graph. Specifically, we 
construct a geometric graph Gc = (V,E), where V is a set of points on the lines in C, and E 
consists of line segments connecting some of these points; the weight of an edge is its Euclidean 
length. We have V = UILi where Vi C t{ (i = 1, . . . , n) naturally form n groups, one for each 
line. We then run an approximation algorithm for the group Steiner tree problem on this graph. 

It is well known that an optimal TSP tour for points can be 2-approximated by a minimum 
spanning tree (a TSP tour is obtained by doubling the edges of the MST and by using shortcuts 
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and the triangle inequality). Reich and Widmayer |44j introduced the following group Steiner 
tree (a.k.a., one-of-a-set Steiner tree) problem. Given an edge weighted graph G = (V, E) and g 
groups of vertices V\, . . . ,V g C V, \V\ = n, find a tree in G of minimum weight which includes 
at least one vertex from each group. The problem is known to be APX-hard [5], and it cannot 
be approximated better than S7(log 2 ~ e n) for any e > unless NP admits quasipolynomial-time 
Las Vegas algorithms [26]. The current best approximation ratio, 0(log 2 ralogg) comes from the 
algorithm of Garg et al. [23] as further refined by Fakcharoenphol et al. [19J. As before with the 
MST, by doubling the edges of such a tree, and by using shortcuts and the triangle inequality, one 
can obtain a Hamiltonian cycle which includes at least one vertex from each group. Moreover the 
approximation ratio of this cycle is of the same order with the approximation ratio of the group 
Steiner tree used. 

We construct the graph Gc for a set C of n lines in M. d shortly. Lemma [5] below shows that the 
length of a minimum group Steiner tree in Gc is a constant-factor approximation for the minimum 
TSP tour for C. In our case, the graph Gc has 0(n 3 ) vertices and the number of groups is n, so 
the (9(log 2 n log ^-approximation \19\ 124] for the group Steiner tree problem on a graph with n 
vertices and g groups yields an 0(log 3 n)-approximation for TSPN with n lines in M d . 

Intuition. A transversal between two lines, £\ and £2, is a line segment P1P2 with p\ G £\ and 
P2 G 0.2- A minimum transversal of two lines is one of minimum length; is known to be orthogonal 
to both lines, and if the two lines intersect, it is a segment of zero length (i.e., p\ = p2). 

The graph Gc is defined in terms of a set S of transversal segments whose endpoints are on 
distinct lines from C. The vertices of Gc will be the set of endpoints of the segments in S; the edges 
of Gc will include all segments in S, and all segments along the lines in C between consecutive 
vertices. The set S will include the minimum transversal between all pairs of lines. We show 
(cf. Lemma [3]) that a segment PiPj, with pi G £i and pj G £j, can be approximated by the detour via 
the minimum transversal of £i and £j, if the angle between £{ and £j is bounded from below. The 
minimum transversals, however, are insufficient if several consecutive lines visited by an optimal 
tour are nearly parallel. For nearly parallel lines in C, we include many additional transversal 
segments in S, which lie in several parallel planes. It turns out (cf. Lemma H|) that it is sufficient 
to add extra connections in planes incident to the endpoints of minimum transversals among these 
lines. 

Two Technical Lemmas. The approximation relies on Lemmas [3] and 0] (below) . According to 
LemmaEl if the directions of two lines are far apart, then a connecting segment can be approximated 
by a 3-segment path that detours through the minimum transversal of the two lines. According 
to Lemma [4j if two lines are near vertical, then the near horizontal segments connecting them get 
shorter if their endpoints are closer to the minimum transversal. 

Lemma 3. Let l\ and £2 be two lines in ~R d such that the angle between their directions is ip G 
((po, |]. Let S1S2 be their minimum transversal with s\ G l\ and S2 G 12- Let pi G l\ and P2 G I2 be 



Proof. Consider the 3-dimensional affine subspace spanned by l\ and £2 ■ Without loss of generality, 
we may assume that £\ is the x-axis, p\ = (a, 0, 0), si = (0, 0, 0), and S2 = (0, 0, h) as in Fig. [2](left). 
Let a = \p\S\\ and b = \p2S2\- The Cauchy-Schwarz inequality yields the upper bound 



two points. Then \piSi\ + | Si S2 1 + \ S 2P2\ < 




+ \sis 2 \ + |s 2 pi| = a + h + b < y / 3(a 2 + b 2 + h 2 ). 



S 



Figure 2: Left: The angle between lines l\ and I2 is tp. The distance between p\ E l\ and P2 E i\ 
is approximated by the polygonal path (pi, s\, S2,pa) that passes through the minimum transversal S1S2 
between the two lines. Middle: If \piP2\ < gl^i^al) then p\ and P2 are much closer to the minimum 
transversal than ri and r 2 , respectively. Right: Two triangles with angle tp. The other two angles are equal 
in one triangle, and they differ by at most 2tp in the other triangle. 



If x{p2) > (as in Fig. [2j left), then the law of cosines yields 

1 2 = h 2 + a 2 + b 2 — 2ab cos tp 

= h 2 + (a - b) 2 costp + (a 2 + b 2 )(l - cost/?) 

> (l-cos(/?)(a 2 + 6 2 + /i 2 ) 

> (1 -cos(^o)(a 2 + & 2 + /i 2 )- 

If x(p 2 ) < 0, then |/?ip 2 | 2 = h 2 + a 2 + b 2 - 2a6cos(7r - tp) > h? + a 2 + b 2 , since cos(7r - tp) < 0, 
and we obtain |pij»2 1 2 > (1 — cos<y9o)( a2 + 6 2 + /i 2 ) in this case, as well. In both cases, the claimed 
inequality follows after taking square roots. □ 

Lemma 4. Let l\ and £2 be two lines in M. d such that the angle between their directions is tp £ (0, 
and the direction of each line differs from the Xd-axis by at most tp 6 [0, Let pi £ £\ and P2 £ £2 
be two arbitrary points on the two lines; let r\ 6 £\ and r 2 G £ 2 be their intersection points with 
a hyperplane orthogonal to the x^-axis; and S1S2 be their minimum transversal with si £ £\ and 
S2 S £2 (Fig. IE middle). If \piP2\ < jln^l, then \pisi\ < ^ |risi| and |p2«2 1 < ^ |r2S 2 |- 

Proof. Let h = \s\S2\ be the distance between the two lines. Put a = \p\si\, b = \p2S2\, e = |risi|, 
and / = I r 2 S2 1 • By the law of cosines, we have 1 2 = h 2 + a 2 + b 2 — 2abcostp. The sum of 
the last three terms in this expression is c 2 = a 2 + b 2 — 2abcostp, where c is the third side of a 
triangle with two adjacent sides of lengths a and b that meet at angle tp. Denote by (5 the angle of 
this triangle opposite to the longer of a and b. Then the law of sines yields a 2 + b 2 — 2ab cos tp = 
(max{a,6}) 2 • jj^a a > (max{a, b}) 2 sin 2 p. Consequently, 

IP1P2I 2 > h 2 + (max{a, b}) 2 sin 2 p. (3) 

Similarly, by the law of cosines we have |rir2| 2 = h 2 + e 2 + f 2 — 2ef cos p. Consider a triangle 
where e and / are adjacent sides parallel with r\ s\ and r2S2 respectively, that meet at angle p. Since 
the directions of l\ and £2 differ from vertical by at most tp, the angle opposite to the shorter of e 
and / is at least | — ip. Hence the law of sines yields e 2 + f 2 — 2ef cos tp < (min{e, f}) 2 • siri S(^/2-ip) ' 
and consequently 

\ ri r 2 \ 2 <h 2 + (min{e,/}) 2 • . 2 f* V (4) 

sin (7T/2 — ip) 
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The inequality |_pip2 1 < ^| r iT2| in combination with inequalities ([3]) and implies 
h 2 + (max{a, b}) 2 sin 2 ip <\(h 2 + (min{e, /}) 



<^ + I(min{eJ}) 2 -^_ ) (5) 

and further (after reducing the term h 2 and taking square roots) that 

min-f 6, f \ 

max{a,6) < ; — ^ — . (6) 

1 ; ~ 3sin(vr/2 - ip) w 

If V € [0, §], then sin (f - if)) > sin(f) = ^ and so max{a,6} < ^min{e,/}. It follows 
that a < and b < ^p/, as required. □ 

Construction of graph Gc- As noted above, Gc is given in terms of a set S of transversal 
segments among the lines: the vertices of Gc are the set of endpoints of the segments in S; the 
edges of Gc include all segments in S, and all segments along the lines in C between consecutive 
vertices. We use two types of segments, S\ and 52, with S = S\ U S%. Let S\ be the set of minimum 
transversals between all (!J) pairs of lines in C. The segments in S2 will connect families of "near 
parallel" lines in C. The definition of these families requires some preparation. 

The direction of a line in M rf is represented by a parallel line thorough the origin. The space 
of directions is the set of lines passing through the origin, also known as the Grassmann manifold 
Gr(l,d). It is a compact metric space, where the distance between two directions is the angle in 
[0, |] between the two lines passing through the origin. Since Gr(l, d) is compact, it can be covered 
by r = O(l) neighborhoods of radius 7r/12 (where r depends on the constant d > 3). Let B{Lj, y^), 
j = 1, ... ,r, be a cover of all directions with ^-neighborhoods for some Lj £ Gr(l,d). That is, 
Gr(l,d) = UJ=i-^(i> Tj)' ^ increasing the radii of these neighborhoods from ^ to ^, it is also 
clear that 

T 

Gr(M) = U 



For j = 1, . . . , r, let Cj C C denote the set of lines whose directions are in B(Lj, |). Observe that 
the angle between any two lines in a set Cj is at most ^, which is the diameter of B(Lj, ^). Note 
also that the sets Cj, j = 1, . . . ,r are not necessarily disjoint. In any case, we have 

r 

C={JCj. 

3=1 

For t > 1 and a set of points P in M. d , a t-spanner is a geometric graph G on the vertex set P 
such that the Euclidean length of the shortest path in G between any two vertices, p and q, is at 
most t\pq\. For a set of n points in M. d and a constant t > 1, a bounded-degree t-spanner can be 
computed in 0(nlog d ~ 1 n) time |40j . For d = 2, the Delaunay triangulation is a 1.998-spanner [51] 
with 0(n) edges, and it can be computed in 0(n log n) time. 

We now define the segments in S2] see Fig. [3jleft). For each endpoint p of each segment in 
Si, let £(p) £ C denote the line containing p. For every such segment endpoint p, and for every 
j = 1,... ,r such that i{p) G £j, let Hj(p) denote the hyperplane orthogonal to Lj and containing 
p, and let P(p,j) = {Hj(p) n£:££ Cj} be the set of intersection points of Hj(p) with the lines in 
Cj. Construct a 2-spanner for the n points in P(p,j) C Hj(p), and add all edges of the 2-spanner 
to the set S^- This completes the description of the segments in S2. 
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S = S\ U S2 determines Gc- The segments in Si have n(n — 1) endpoints, and for each 
endpoint we compute no more than r = O(l) 2-spanners, each with 0(n) vertices and edges. So 
S2 contains 0(n 3 ) segments and Gc has 0(n 3 ) vertices and edges. 




Figure 3: Left: A set of four lines £\, . . . , £4, whose directions are within ir/6 from the vertical direction dj. 
The minimum transversals between £4 and the other three lines are P191, P2Q2 and P393. For i = 1,2, 3, we 
insert a 2-spanner in the horizontal planes Hi(pi) containing point p t . (2-spanners in other parallel planes 
are not shown in this figure.) Right: An optimal tour OPT visits the lines £\,£2, £3, £4 at points Pi,P2,P3,P4, 
respectively. We construct a path 71 = (gi 929394) that visits these lines in the same order. Points gi,g2,93 
are in a horizontal plane Hi = H 2 = H 3 . Point 94 is in a plane H4 7^ i? 3 because |g3r4| > 3 |_P3P4 1 . 



Group Steiner tree yields a constant-factor approximation for TSP with lines. We 

prove the following. 

Lemma 5. Let C be a set of n lines in M 3 . Then the length of a minimum group Steiner tree in 
Gc is O(l) times the length of a minimum TSP tour for the lines in C. 

Proof. Let C = {£1, . . . , £ n }, where the lines are indexed so that an optimal TSP tour is OPT(£) = 
{pi,... ,p n ) with pi G li, i = 1, . . . , n. We show that Gc contains a group Steiner tree T of length 
at most 68 len(OPT(£)). (The proof does not use the optimality of the tour OPT(£), i.e., for any 
cycle C = {pi, . . . ,p n ), Pi £ 1%, we construct a group Steiner tree of length at most 681en(C).) 
The tree T will consist of a main (backbone) path 70, and a path attached to each vertex of the 
backbone. 

Decompose the sequence {li,... ,£ n ) into maximal subsequences (which we call blocks) 

(^r(i))^r(i)+l' ■ ■ ■ > i = 1, 2, . . . , 

for some k as follows. Let r(l) = 1, and for each i = 1, . . . , k — 1, let r(i + 1) be the first index such 
that the directions of and ^ T (i+i) differ by more than That is, the directions of the lines in 
the i-th block differ from the direction of £ T ^ by at most j^. This implies that if line £ T ^ is in the 
neighborhood B(Lj, ^) C Gr(l,d), then all lines in the the i-th block are in B(Lj, ?) C Gr(l,d). 

Consider the sequence of the first elements of the blocks, {i T m , £ T (2) , • • • ,K(k))- By construc- 
tion, the directions of any two consecutive lines differ by more than j^. For i = l,...,k — 
1, let s T u\t T n + i\ denote the minimum transversal between & T u\ and ^ r (j+i) with s T u\ £ l T u\ 
and t r (j + i) G The "backbone" of the group Steiner tree T is the polygonal path 70 = 
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(s r ( 1 )t r (2) s r(2)^r(3) ■ ■ ■ ■ s r(fc-i)^r(fc))- By LemmaO len(7o) is bounded from above by 9.41en(OPT(£)): 
len(7o) = len(s r(1) t r(2) s r(2) t r(3) ...s r(fc _ 1) t r(fc) ) 

< len(p T (i)S T (i)t r (2)P T (2)) + • • • + len(p r ( fc _ 1) s r(fc _ 1 )t T ( A .)p T ( fe )) 

< ]J I _ J (7T/12) MPr(l)ft-pi) . . . p Tik) ) 

< 9.4 len(pip 2 ■ ■ ■ Vn) 

< 9.4 OPT(£). (7) 

For each block {£ T u) , £ T (i)+i i • • • >A-(i+i)-l)> i = 1, . . . , A;, we attach a path 7^ to the backbone 
7o- We only explain this procedure for the first block; other blocks are treated analogously. Put 
m = t(2) — 1, so the first block is {£\,£ii ■ ■ ■ ,l m ) (recall that r(l) = 1). The directions of the 
lines in a block are in a ^-neighborhood B(Lj, ^), for some direction Lj E Gr(l,d). Without loss 
of generality, we may assume that j = 1 and L\ is parallel to the x^-axis, that is, the directions 
of l\ , . . . , t m differ from the x^-axis by at most 5 . For brevity, a segment parallel to the x^-axis 
is called vertical, and a segment or hyperplane orthogonal to the x^-axis is called horizontal. We 
construct a path 71 in Gc that starts from si and includes some point qi E li for each % = 1, . . . , m, 
in this order. The path 71 will use horizontal transversal segments from S2 between lines in C, and 
possibly other edges along the lines £%,... ,£ m . 

We construct 71 incrementally. Refer to Fig. [3] (right). Initially, it is a single- vertex path 
7i = (<7i)> where q\ = s\ E l\. We extend 7, in m — 1 steps to visit points qi E £i, i = 2, . . . ,m. 
In step i, ideally we would like to find a point E ^+1 in the horizontal hyperplane containing 
gj. However, if the distance from % to li+\ in this horizontal hyperplane is larger than 3 
then 71 will be continued in the horizontal hyperplane that contains one of the endpoints of the 
minimal transversal between li and ^4.1 . 

Suppose that we have already built the path 71 up to vertex % E £{, i E {1, . . . ,m — 1}. We 
choose qi + i E li+i, and the portion of 71 from qi to qi + \ as follows. Let Hi be the horizontal 
hyperplane containing Let r^+i = n -ffj (by construction, rj+i is a vertex of Gx). We 
distinguish between two cases: 

• If jgjrj+il < 3 then let qi + \ = ty+x, and extend the path 71 by the shortest path from 
qi to qi + x in the 2-spanner in Hi. 

• Otherwise denote the minimum transversal between £j and li+i by SjSj+i with Sj G £j and 
Sj E Let ilj+i be the horizontal hyperplane containing Si + \ £ £i+i, and let m = ^fl-ffj+i. 
Now let Qj+i = and extend 71 by a path from to Ui E Hi+i along £j, and by the shortest 
path from m to gj+i in the 2-spanner in the hyperplane Hi + \. 

For estimating len(7i), we consider the transversal segments and the edges along the lines in 
C separately. All transversal segments in 71 are horizontal. The total length of these transver- 
sal segments between qi and qi + \ is at most 6|piPi+i|, and consequently, the total length of all 
transversal segments in 71 is at most 61en(pi . . .p m ). Indeed, in the first case the 2-spanner in Hi 
contains a path between and qi + \ of length at most 2 l^ift+il = 2 l^ri+ij < 6 |piPi+i|. In the 
second case, the 2-spanner in i^+i contains a path between m = £i D -ffj+i and qi + \ of length at 
most 2 \uiq i+1 \ = 2 |ujS i+ i| < cos( ^ /6) \si§i+l\ = 4j \s{Si+l\ < -75 \PiPi+l\, where the inequality 
l^iSj+il < cos(tt/6) foh° w s from the fact that the direction of li differs from vertical by at 

most ?, and so the right triangle msiSi+i has an interior angle at most ? at Sj+i. 

It remains to bound the total length of the edges in 71 that lie along the lines £±, . . . ,£ m . 
Recall that the directions of the lines £i,...,£ m differ by at most 5 from vertical. Therefore, it is 
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enough to estimate the vertical extents of the edges along these lines, which approximate the actual 
lengths up to a factor of cos (^y 6 ) = ^ • Denoting the total vertical length of the segments in 71 by 

Z = ~YX=i \ z (li) ~ z (li+i)\i the total length of the edges in 71 that lie along the lines t\, . . . ,£ m is 
at most A=Z. 

If qi and qi + \ are in the same horizontal plane, then obviously z(qi) = z{qi + \). Otherwise, 
we have |pijpj+i| < by construction. In this case, Lemma [4] is applicable, and it gives 

|pi + iSj + i| < ^^|rj + iSj + i|. The segments Pi+iSj+i and rj + iSj + i are collinear, hence the inequality 

holds for their projections to the a^-axis, that is, \z(pi+i) — z(ii+i)\ < ^^\z(ri+i) — 
Substituting s^+i = qi+i, we obtain 

2\/3 

\ z (Pi+i) ~ z (Qi+i)\ < —^-\ z (n) - z (n+i)\^ ( 8 ) 

for every i £ {1, ... ,m — 1} where z{q{) 7^ z{qi + \). Inequality ([8]) holds only for the indices i E 
{1, . . . , m— 1} where the segment (Mi+l °f 7l connects two different horizontal hyperplanes. To mark 
these indices, we introduce the following notation. Let <r(l) = 1 and let 1 < a(2) < . . . < a(t) < m 
be the sequence of indices such that z(q a u\i) / z(q a /{\) for i = 2,...,t. With this notation, 
we have z(q a{i) ) = z(q a(ji+1) _ l ) for i = 1, . . . , t - 1, and hence Z = ^*=2 \ z (Qa(i)-i) ~ z {<la(i))\ = 
Y^h=i \ z (la(i)) ~ z (.Qa(i+l))\- Then Z is bounded from above by: 

t-l 

Z = 53l*(9«7(t))-*(9«r(i+l))l 

1=1 

t-l 

< Yl (\ Z (^(i)) ~ Z (P*(i))\ + \ Z (P<r(i)) ~ z (Pa(i+l))\ + \ Z (Pa(i+l)) - z (Qa(i+l))\) 
i=l 

t-l t-l 

< \ z (qa(l)) -z(Pa(l))\ + J^k(P«r(i)) ~ z (P*(i+l))\ + 2 X1 I Z (P^+l) ) ~ z {la(i+l))\ 

i=l i=l 

< - + J] \ Z (P*(i)) ~ z (Pv(i+l))\ + l Z (^( 4 +l)-l) - «(ftr(i+l))l 

i=l i=l 



m 4\/3 
< |z(si) - z(pt)\ + ^ |«(pi) - z(p i+1 )\ + — - Z, 

4 = 1 



where we used the triangle inequality, inequality (JSJ), and the fact that gi = s\. After rearranging, 
we obtain 



Z < 



< 



9 X 



-7-77= ^N(ai) - *(pi)| + ~ z ( pi+1 



9-4^/3 



rn—l 

\ S lPl\ + l^+il 

i=l 



< 9 _ 4 ^/g (9-4|y>ip m+ i| + len(pip 2 ■■■Pm)) 
9 

- 9 _ 4^/3 ' 10 - 41en (PlP2 ■ ■■Pm+l), 
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where we used < len(pisit m+ ip m+ i) < 9A\pip m+ i\ from ([7]). It follows that 

len( 7l ) < U + A . q _ 9 4 ^_ • 10.4^ len(p T(1) . . . Pr{2) ). (9) 

Since this bound holds for each block, we have thereby shown (according to ([7|) and ([9])) that Gc 
contains a group Steiner tree for C whose length is 

k ( 2 9 \ 

len(7o) + Vlen( 7i ) < 9.4 + 6 + —= = • 10.4 len(OPT(£)) < 681en(OPT(£)), 

~{ V v3 9-4V3 / 

as claimed. □ 

4 TSPN for unit disks and balls 

Without a doubt, among the simplest neighborhoods are congruent disks [21 [T5] . TSPN for unit 
disks is NP-hard, since when the disk centers are fixed and the radius tends to zero, the problem 
reduces to a TSP for points. Given a set S of n points in the plane, let T> = T>(S, r) be the set of n 
disks of radius r centered at the points. It is known (and easy to argue) that the optimal tours for 
the points and the disks, respectively, are polygonal tours with at most n sides. The lengths of the 
optimal tours for the points and the disks are not too far from each other. Indeed, given any tour 
of the n disks, one can convert it into a tour of the n centers by adding detours of length at most 
2r at each of the n visiting points (arbitrarily selected); see e.g., [T5l [2T] . Let OPT(£) denote a 
shortest TSP tour of S, and OPT(S, r) denote a shortest TSP tour of the disks of radius r centered 
at the points in S. Consequently, for each n > 3 and r > 0, we have: 

len(OPT(5)) - len(OPT(S, r)) < 2 nr. (10) 



The first PTAS for TSPN with unit disks was obtained in [15J. As it is the case with TSP 
for points, the known approximation schemes are highly impracticable; see the comments in |33j 
and [281 Ch. 13-14]. This is even more the case for the approximation schemes for TSPN, including 
disks. Designing more practical (efficient) algorithms with a constant approximation factor has 
remained of increased interest. The obvious motivation is to provide faster and conceptually simpler 
algorithmic solutions. 



4.1 Unit disks: an improved approximation 

Background. The current best approximation ratio for the TSP with n unit disks, 7.62, was 
obtained in [15J. The algorithm works by reducing the problem for n disks to one for at most 
n (representative) points (representative points could be shared). These points are selected after 
computing a line cover consisting of parallel lines. More generally, this ratio holds for translates of 
a convex region. An alternative approach (also from [15]) selects representative points from among 
the centers of the disks (i.e., a suitable subset). However, the approximation obtained in [15] in 
this way is weaker. For instance, starting from a (1 + e)-approximation for the center points yields 
a ratio of (8 + 7r)(l + e) < 11.16, provided that e < 0.001. Starting from a 3/2- approximation (with 
a faster algorithm) for the center points yields a ratio of (8 + 7r)1.5 < 16.72. 

Here we improve the two asymptotic approximation ratios, from 7.62 to 6.75 (when using the 
PTAS for points), and from 11.43 to 8.52 (when using the faster 1.5- approximation for points). 
Somewhat surprisingly, we employ the latter approach with center points, which gave previously 
only a weaker bound. It is worth mentioning that the ratios for the special case of disjoint unit 
disks remain unchanged, at 3.55 and 5.32, respectively. We now proceed with the details. 
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A simple packing argument. Let B(x) denote a ball of radius x centered at the origin. Let 
G = (V, E) be a connected geometric graph in M 2 and let L = len(G). Let C be the set of points at 
distance at most x from the edges and vertices of G. Equivalently, C = G + B(x) is the Minkowski 
sum of G and B(x). We need the following inequality. 

Lemma 6. Area(C) < 2Lx + irx 2 . This bound cannot be improved. 

Proof. Consider B(x) + vq for an arbitrary vertex vq of G and mark vertex vq. The area covered so 
far is -kx 2 . Pick an edge uv of G where u is marked and v is unmarked. Place B(x) with the center 
at u and translate B(x) along uv (its center moves from u to v), and mark v. Observe that the 
newly covered area is at most 2\uv \x. Continue and repeat this step as long as there are unmarked 
vertices. Since G is connected the procedure will terminate when all vertices of G are marked. It 
follows that the area of C is at most 

9 X ^ i i 9 

ttx + y, 2\uv\x = 2Lx + TTX , 
uveE(G) 

as required. 

Equality holds if and only if G is a straight-line path. Indeed, except for the first step (i.e., 
each step involving an edge) the newly covered area is strictly less than 2|uu|x, unless all edges of 
G are collinear in a straight-line path. □ 




Figure 4: From left to right: (i) a line-sweep independent set (in bold lines); (ii) the curve 7; (hi) a part of 
the constructed disk tour. 



Approximation algorithm — outline. As in [15], the general idea is to first compute a max- 
imal independent set and then an approximate tour of the centers of the independent set. The 
approximate tour of the centers is then modified (extended) so that it visits all the other disks (not 
in the independent set). However the details differ significantly in both phases of the algorithm, in 
order to obtain a better approximation ratio: a monotone independent set is found, and a tailored 
visiting procedure is employed that takes advantage of the special form of the independent set. 

Let P be a set of unit disks. First, compute a maximal independent set of disks IcPby the 
following line-sweep algorithm. Select a leftmost disk u) € T> and include it in X. Remove from T> 
all disks intersecting u. Repeat this selection step as long as T> is non-empty. 

We call X a line-sweep independent set or x-monotone independent set. Clearly, X is a maximal 
independent set in T>, that is, each disk in T> \ X intersects a disk in X. Moreover, by construction, 
each disk in D \ I intersects the right half-circle boundary of a disk in X. 

Let L* = len(OPT(£>)) and L* x = len(OPT(X)). Obviously, L* x < L* . 
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Algorithm. The algorithm for computing a TSP tour of the disks is as follows. Compute a 
(maximal) line-sweep independent set X; write k = |X|. Next, compute Tj = oi...Ok, an a- 
approximate tour of the center points of disks in X, for some constant a > 1. If we use the PTAS 
for Euclidean TSP [HIES], for a given < < e < 1/2, we have a = 1+e. If we use the approximation 
algorithm for metric TSP due to Christofides [10] , we have a = 1.5. 

Write Si = {o\, 0%, ■ ■ ■ , Ok}- For each disk to El, let to~ and to + be the two unit disks tangent 
to to from below and from above, respectively. Let o~ and o + be the centers of to~ and to + , 
respectively. See Fig. E|h). Let 7(0;) be the open curve obtained as follows: start with the tangent 
segment of positive slope from tow; concatenate the arc of to subtending a center angle of ir/3 
and symmetric about the x-axis; concatenate the tangent segment of negative slope from w to o + . 
Now remove two unit segments, one from each endpoint of the curve obtained in the previous step. 
The resulting curve is 7 = j(to). Observe that the open curve j(oo) intersects any unit disk from 
T> that intersects the right half-circle boundary of to (this includes 10 as well). Let v denote the 
vertical segment connecting the endpoints of 7. It is easy to check that 

len(7) = 2(! + 2cos^-l) = 2^ + \/3-l) < 2.512, 

len(v) = 4 - y/3 < 2.268. (11) 

Replace each segment OjO.; + i of this tour, with i odd, by a parallel segment of equal length 
connecting the two highest endpoints of the curves j(tOi) and 7(0^+1). Similarly, replace each 
segment OjOj+i of this tour, with i even, by a parallel segment of equal length connecting the two 
lowest endpoints of 7(0;^) and 7(^+1). See Fig. E^iii). 

To obtain a tour (closed curve) we visit the disks in I in the same order as T%. After each 
segment, the tour traverses the corresponding curve 7(0;) (going up or down, as needed, in an 
alternating fashion). If k is even we proceed as above, while if k is odd, the curve 7(^1) is traversed 
in a circular way (going down along 7 and up again along the vertical segment v) in order to get a 
closed curve. We call T the resulting tour. 



Algorithm analysis. Observe that T is a valid tour for the disks in D. Further observe that the 
disjoint unit disks in I are contained in the figure C = T£ + B(2). By Lemma[6l 

ir\l\ < Area(C) < 41en(T|) + 4vr, 

hence 

k = \X\ < —LX- + 4 < —L* + 4. (12) 

The total length of the detours incurred by T over all disks in I is A;len(7) + len(-u), hence 
by (jlip the length of the output tour is bounded from above as follows. 

L < L 5l + (2.512A; + 2.268). (13) 

Inequality (|12p implies the following upper bound on the second term in (|13p . 



2.512& + 2.268 < 2.512 ( —L* + 4 ) + 2.268. (14) 



TT 



We next bound from above the first term in (I13h . The inequality ()10p applied to X and S% 
yields 

L* Sx <L* I + 2k. (15) 
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Since the algorithm computes a a- approximation of the optimal tour for the points in Sx, 
by (fT2l) we have 



Ls x < aL* Sl < a{L* x + 2k) < a{L* + 2k) 
<a(l* + 2 (J^L* + 4 

<a((l + ^\L* + sj. (16) 

Substituting into (JT3J) the upper bounds in (fl~6|) and (fT4"|) yields 

L < a ^1 + ^ L* + 8^ + 2.512 (J^-L* + 4j + 2.268 

< (a (l + ^\ + 2.512 • ^ L* + (8a + 4 • 2.512 + 2.268) 

< (3.5465a + 3.1984)L* + (8a + 12.32). (17) 

For a = 1 + e (using the PTAS for the center points), the length of the output tour is L < 
6.75 L* + 20.4, assuming that e < 0.001. A more precise calculation along the lines above yields 
the following upper on the main term (in L*); the constant factor appears in Theorem [3j note also 
that 1/0.53 > 1.8, which explains the other parameter in Theorem[3l 

(l + 0.53e)L*. 




The running time is dominated by that of computing a (1 + ^-approximation of the optimal tour 
of n points in M 2 . 

For a = 1.5 (using the algorithm of Christofides for the center points), the length of the output 
tour is L < 8.52 L* + 24.4. The running time is dominated by that of computing a minimum-length 
perfect matching on n points in the plane (n even), e.g., 0(r?l 2 log 5 n) by using the algorithm of 
Varadarajan [50J. 



Remark. If the input consists of pairwise-disjoint (unit) disks, then (|16p yields improved ap- 
proximations. These are not new: the case a = 1 + e was already analyzed in [15]; we just list 
them for comparison. For a = 1 + e, (|16p yields L < 3.55 L* + 8.01, assuming that e < 0.001. 
For a = 1.5, (|16p yields L < 5.32 L* + 12. The approximation ratio 3.55 for disjoint unit disks is 
probably far from tight; the current best lower bound is 2, see [15]. The example in [27, Fig. 4] is 
yet another instance with a ratio (lower bound) of 2. Hence the approximation ratio 6.75 for unit 
disks (which uses the above) is probably also far from tight. 



Remarks. 1. A simple example shows that one cannot extend the above approach to disks of 
arbitrary radii. Let x > 1. See Fig. [5{left) where n = 3, and Fig. [5] (right) for its analogue with 
arbitrarily large n. Let x — > oo and e — > 0. 

(i) Suppose that we first compute a maximal independent set I in a greedy manner, by se- 
lecting disks in increasing order of their radii. Further suppose that we start by computing T, 
a constant approximation for the shortest TSP tour on X, for instance by using the algorithm 
of de Berg et al. [H] (recall, this algorithm works with fat, disjoint regions). In some instances, 
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no constant factor extension (by adding suitable detours to visit the remaining disks) exists. In 
Fig. Epeft), len(OPT(X)) = 2s, while len(OPT) = Ax. Moreover, since x — > oo, no asymptotic 
constant factor can be guaranteed by this approach; indeed, for any constants a, f3, there exists x 
large enough, such that a 2s + /3 < Ax. 




£ 



Figure 5: A set of three disks of radii 1, X, and x, centered at 0, 1 + x + e and 1 + 3a; (left) and a set of n 
disks, n > 3, of radii 1 , . . . , f , x and x (right). A maximal independent set of disks (in bold) is shown for 
each case. 

(ii) Suppose that we first compute a maximal line-sweep independent set, as in our algorithm 
for unit disks. The same example depicted in Fig. Odeft) shows that no constant factor extension 
(by adding suitable detours to visit the remaining disks) exists. Moreover, as in (i), since x — > oo, 
no asymptotic constant factor can be guaranteed by this approach. 

2. Consider an algorithm which hrst computes a maximal independent set I of disks (according 
to some criterion), then computes a good approximate tour of the disks in I, and then extends 
this tour with the boundary circles of the disks in I (in some way). Observe that the length of the 
overall detour incurred in this way is proportional to ^iex r * - ^he following claim (and example) 
shows a deeper cause for which this general approach does not give a constant approximation ratio. 

Claim. For every M > 0, there exists a disk packing in the unit square [0, l] 2 with > M 
and all disks tangent to the unit segment [0, 1] x [0,0]. 

Proof. We place disks in layers of decreasing radius. Each layer consists of congruent disks placed 
in blocks in between consecutive tangent disks of the previous layer, or in between a disk and a 
vertical side, as in Fig. [U The first layer consists of k disks of radius l/(2/c), for some k > 1. By 




Figure 6: The first two layers of an iterative construction: k — 1 (left), and k — 2 (right). 

choosing the radius of the disks in the next layer much smaller than the radius of the disks in the 
current layer, one can "cover" any prescribed large fraction p < 1 of the length of the bottom side 
of the square by disks tangent to the bottom side of the square and having the sum of radii at least 
p/2. Consequently, by using sufficiently many layers, one can achieve ^ r « — ^\ as required. □ 

4.2 Unit balls in R 3 : an improved approximation 

We need the following analogue of Lemma [6l The proof method is the same. Let B(x) denote 
a ball of radius x. Let G = (V, E) be a connected geometric graph in R 3 and let L = len(G). 
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Let C be the set of points at distance at most x from the edges and vertices of G. Equivalently, 
C = G + B{x) is the Minkowski sum of G and B{x). 



Lemma 7. Vol(C) < vrx 2 L + ^-x 3 . 



This bound cannot be improved. 



Let P be a set of unit balls (as input). As in the planar case, we compute a maximal independent 
set of disks X C T> by a plane-sweep algorithm. For convenience, we sweep a horizontal plane in the 
positive direction of the z-axis. We call X a plane-sweep independent set or z-monotone independent 
set. 

The algorithm computes a tour of T> as follows. First, compute a maximal z-monotone inde- 
pendent set X; write k = \T\. Next, compute T% = o\ . . . o^, an a-approximate tour of the center 
points of the balls in X, for some constant a > 1. Write S% = {01, 02, . . . , Of.}. For each ball uGl, 
let r = r(u;) be a discrete set of 28 lattice points associated with 00 (relative to its center). For 
describing this set we will assume for convenience that the center of co is (0,0,0). Let a = l/y/3. 
r contains 16 points in the plane z = a and 12 points in the plane z = 3a; see Fig. HJii). Specifically, 

r = {(—3a, —3a, a), (—3a, —a, a), (—3a, a, a), (—3a, 3a, a), 
(—a, —3a, a), (—a, —a, a), (—a, a, a), (—a, 3a, a), 
(a, —3a, a), (a, —a, a), (a, a, a), (a, 3a, a), 
(3a, —3a, a), (3a, —a, a), (3a, a, a), (3a, 3a, a)} 

U {(—3a, —a, 3a), (—3a, a, 3a), (—a, —3a, 3a), (—a, —a, 3a), (—a, a, 3a), (—a, 3a, 3a), 
(a, —3a, 3a), (a, —a, 3a), (a, a, 3a), (a, 3a, 3a), (3a, —a, 3a), (3a, a, 3a)}. 

One can check that the points in V admit a Hamiltonian path in which each edge has length 
2a, say £(r) = 71, 72, • • • , 728) starting at 71 = (—a, —3a, a) and ending at 728 = (—a, —3a, 3a). 



3a 
a 

—a 
-3a 



• • • • 



-O • • £>■ 



-3a —a a 3a 



Figure 7: The set T has 16 points with z = a and 12 points with z = 3a; |T| = 28. The hollow circles indicate 
the four missing points in the plane z = 3a. 

We will prove shortly that any unit ball that intersects u from above (i.e., the z-coordinate of 
its center is non-negative) contains at least one of the points in T(uj). Moreover, this also holds for 
ui itself. 

We modify (extend) the tour T% = o\ . . . o& as follows. Assume first that k is even. We replace 
each segment OjOj + i of this tour, with i odd, by a parallel segment of equal length connecting 
71 € T{uJi) with 71 G T(uji + i). Similarly, we replace each segment OjOj + i of this tour, with i even, 
by a parallel segment of equal length connecting 728 G r(wj) with 728 £ Y{oji + \). To obtain a 
tour, we visit the balls in X in the same order as T%. After each segment, the tour visits all the 
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28 points in the corresponding set T(u>) by using the Hamiltonian path £(r) and then continues 
with the next segment, etc. This extension procedure can be adapted to work for odd k without 
incurring any increase in cost: specifically, the first cycle of period 2 is replaced by a cycle of period 
3. For odd k, the output TSP tour has the form T = ^i^2^3^ R ^ R ■ ■ ■ ££ R , rather than the form 
T = ££^£s • • • CC R (f° r k even). Here £ R is the path £ traversed in the opposite direction, and 
£i ,£2 >£3 are three suitable Hamiltonian paths on T (details are omitted). 

Algorithm analysis. The analysis of the approximation ratio is similar to that in the planar 
case. The disjoint unit balls in I are contained in the body C = T£ + .6(2). By Lemma[71 

< Vol(C) < 47rlen(T|) + ^8, 

hence 



k = \X\ < 3 [L* + 3 J = 3L * + 8 - ( 18 ) 

The total length of the detours incurred by T over all balls in X is bounded from above by 

2 

(28 - l)2oJfe = 27^=k = 18\/3fc. (19) 
v3 

It follows that the length of the output tour is bounded from above as follows. 

L<L Sx + 18VSk. (20) 

The upper bound on Ls x (analogue of ([TBI) ) is 

L Sl < aL* Sx < a{L* x + 2k) < a(L* + 2k) < a{L* + 2(3L* + 8)) 

= 7aL* + 16a. (21) 

The upper bound on 18\/3fc (analogue of (fill) ) is 

18\/3fc < 18\/3(3L* + 8) = 54\/3L* + 144\/3. (22) 

Substituting into (|20"1) the upper bounds in (J2TT) and (|22|) yields 

L < (7aL* + 16a) + (54\/3L* + 144\/3) 

= (7a + 54^/3)^* + (16a + 144^3). (23) 

For a = 1 + e (using the PTAS for the center points), the length of the output tour is L < 
100.61 L* + 265.6, assuming that e < 0.01. For a = 1.5 (using the algorithm of Christofides for 
the center points), the length of the output tour is L < 104.1 L* + 273.5. The running time is 
dominated by that of computing a minimum- length perfect matching on n points in R 3 (n even), 
e.g., 0(n 3 ) [21]. 

Lemma 8. Let u and u' be two intersecting unit balls, centered at (0, 0, 0) and (x, y, z), respectively, 
where z > 0. Then u contains a point in T(uj). 
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Proof. By symmetry, it suffices to prove the claim when x,y > 0. We therefore have x,y,z > 
and x 2 + y 2 + z 2 < 4. We distinguish two cases, depending on whether z < 2a or z > 2a. If z < 2a, 
we show that u; contains a point of T in the lower plane a\ : z = a; if z > 2a, we show that oj 
contains a point of V in the higher plane a% : z = 3a. Write Y\ = T C\ o~\, and r3 = T n 03. 

Case 1: z < 2a. Since x 2 + y 2 + z 2 < 4, we have max(x,y) < 2 < 4a. The closest lattice point 
7 = (lx,l y ,7z) £ Ti to (x,y,z) satisfies 

|» — 7aj|<o, |y-7 y |<a, and \z - j z \ < a, 

thus 

(a - Ixf + {y~ ly? + {z~ Iz? < 3a 2 = 1, 

as required. 

Case 2: z > 2a. Since x 2 + y 2 + z 2 < 4, we have x 2 + y 2 < 4 — 4a 2 = 8/3. Observe that the 
disk x 2 + y 2 < 8/3 does not intersect the interior of the square [2a, 3a] 2 in the plane z = 0. Thus 
the projection of (x, y, z) onto the plane z = is contained in [0, 3a] 2 \ (2a, 3a] 2 . This implies that 
the closest lattice point 7 = ^) X ily,lz) 6 ^3 to (x,y,z) satisfies 

\x-j x \<a, \y-j y \<a, and \z - j z \ < a, 

and the conclusion follows as in Case 1. □ 

Remark. Analogous to the planar case, if the input consists of pairwise-disjoint (unit) balls, 
then (|21|) yields improved approximations. For a = 1 + e, (|21|) yields L < 7.01 L* + 16.1, assuming 
that e < 0.001. For a = 1.5, (HU) yields L < 10.5 L* + 24. 

5 Conclusion 

We have revisited TSP with neighborhoods and have obtained several approximation algorithms: 
some for neighborhoods previously less studied (lines and hyperplanes in M. d ), and some for the 
most previously studied (such as disks and balls). Despite the progress, one may rightfully say that 
the general problem of TSP with neighborhoods is far from resolved. Interesting questions remain 
open regarding the structure of optimal TSP tours for lines, segments, balls, and hyperplanes, and 
the degree of approximation achievable for these problems. We list the simplest and most natural 
questions (still unsolved) that we could identify. 

(1) Is there a polynomial-time algorithm for planes in M 3 ? 

(2) Is there a constant approximation algorithm for lines in R 3 (or in R rf for d > 3)? Can the 
current 0(log 3 n) ratio be improved? 

(3) Can the approximation factors for unit disks (3.55 for disjoint, 6.75 for arbitrary) and balls 
(7.01 for disjoint, 100.61 for arbitrary) be further reduced while maintaining (or reducing) 
the running times? 

(4) Is there a constant approximation algorithm for disks (of arbitrary radii) in the plane? Is 
there one for balls (of arbitrary radii) in M 3 ? 

(5) Is there a constant approximation algorithm for parallel segments in M 3 ? (To start with, one 
can further assume that the segments are pairwise-disjoint.) 
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